Your student number:22076842

Module code and title:INST0065: Data Visualization and GIS 22/23

Module tutor’s name:Dr Oliver Duke-Williams

The title of the work:Data Visualization Project: Explore the Gender Patterns Behind Plaques

Electronic word count:1980

Introduction

Blue heritage plaques pepper the UK landscape expounding officially validated narratives celebrating past events, people, and buildings.They support concise, easily accessible stories of significant past achievements in the present(Mel, 2015). However, despite their widespread presence and impact on public memory, these plaques have been criticized for perpetuating a narrow and often gendered representation of history. In this assignment, I aim to explore the gender narratives behind plaques by analyzing the data of male and female subjects across different categories and time periods.

The data used in this project is from Open Plaques(openplaques.org, 2023), which is a community-based project that seeks to document and promote commemorative plaques and historical markers. The project makes data available openly and provides mechanisms for improving those data by means of comments on individual records(Nicole, 2012).

Methods

Preprocess

Before proceeding to data visualization, there are some necessary preparations that need to be done. For example, I set several options for code chunks to make the final output cleaner and more focused on the main content, load the necessary packages, read in the plaque data from the CSV file, and select a subset of variables that will be used for visualization, saving them in plaques_subset dataframe.

Visualization One

In the first visualization, I plan to investigate how the number of plaques for male and female subjects, have changed over time. In this process, I filtered the data to include only plaques erected after 1800 because the data before that is too few to reflect the patterns of the era and contain obvious errors that could be disruptive to the analysis. (for example, there are erected years recorded as ‘1’). Also, for the convenience of plotting, I divided the erected data into ten-year intervals using floor(erected/10)*10). Then, the data were grouped and summarized by decade. Also, I used pivot_longer() function to change the data format, allowing for easier visualization, and the mutate() function to rename the variables. The process of getting lead subject data is similar, apart from I used the count() function to compute the number of plaques by decade and lead subject sex.

In the visualization process, ggplot was used to create a composite chart that includes both a bar chart and a line chart to display the data for subjects and lead subjects. The use of two colours, orange and purple, allows for easy differentiation between male and female data.

Visualization Two

Next, I tried to explore the distribution of the roles of commemorated individuals across genders and time periods. I first selected ten of the most frequently occurring roles for males and females respectively using the slice_max() function and focused on them instead of including all roles in the following process. I did this selection because there are more than one thousand roles involved in the data, making it difficult to visualize and identify patterns. Then I used the right_join() function to keep only the rows that contain those top roles and then counted the number of occurrences of each role in each decade and gender. After getting the needed data, I made a scatter plot to show the distribution of lead subject primary roles across time, grouped by gender. The bubble size represents the number of plaques with that specific combination of decade and role, and the colour distinguishes between male and female subjects. Additionally, plotly::ggplotly is used to convert the ggplot chart to a plotly chart, which allows users to hover over data points to see more detailed information, pan and zoom to focus on specific areas of the graph. I add this because the size of the bubble may not accurately reflect subtle differences.

Visualization Three

The third figure serves as a complement to visualization two. The second figure clearly shows the change in the number of each lead subject role over time, but it is not easy to visually compare the number of different roles within each decade. Therefore, in the third plot, I used a stacked bar chart to show the proportion of different lead subject roles within each decade by setting stat = ‘identity’, and position = ‘fill’. Also, I used scale_fill_brewer() function that sets the colour palette for the fill aesthetic. This function is used to ensure that the fill colours of the bars match across the different roles in the plot, making it easier for viewers to compare the proportions of each role. At the same time, I split the data for males and females into two separate plots, more detailedly displaying the role distribution of each gender in each decade.

Visualization Four

In this last figure, I aimed to explore whether the number of plaques, is related to the position in which they are located. To achieve this, I created a geographical plot of the distribution of plaques in the UK. Firstly, I used ‘uk_map = map_data (map = ’world’, region = ‘UK’)’ to retrieve the geographical data of the world map and select only the data for the UK region, storing them in the uk_map dataframe. Then, I used the filter() function to select only the plaques that fall within the geographical bounds of the UK. This was done by comparing the plaque’s longitude and latitude values with the minimum and maximum values of the longitude and latitude in the uk_map data.

In the visualization process, I used geom_polygon() to draw the UK map using the uk_map data, setting the fill colour of the map to light grey and the outline colour to black. Then, I used geom_point() to plot the plaque data onto the map. The plaque’s longitude and latitude values are plotted on the x and y axes, respectively, and the colour of the point is determined by the plaque’s lead subject sex. I manually set the points to orange for the female and purple for the male.

Results

From the early 19th century to the early 21st century, there has always been a significant gap in the number of male and female plaques, both in terms of the subjects included in the plaque and the lead subject. As seen from Visualization One, the number of male plaques has been growing rapidly each year since 1950, with the number of male lead subjects increasing more than tenfold from 96 in 1960 to 967 in 2010, while the growth of female plaques has been much slower. In the year with the highest number of plaques, which was 2010, the number of plaques with female lead subject was only 236. This disparity in the number of male and female plaques to some extend reflects that women’s value and contributions have not been given the same level of recognition as men. In other words, faced with structural and cultural barriers that have historically held women back, women may not have had the same opportunities and environment as men to attain the kinds of accomplishments that lead to recognition. Further confirmation of this can be found in the distribution of roles.

As shown in visualization two and three, roles such as writer, author, poet, and actor have a relatively even gender distribution, taking up high proportion in both sexes. However, some professions show great gender difference, such as architect, engineer, Fellow of the Royal Society, and Member of Parliament. The number of men with these roles is much higher than that of women. Conversely, there are some roles that are predominantly female, such as nurse and singer. These differences can be explained by historical and systemic biases, as well as traditional gender roles and social expectations.

From the temporal perspective, from 1800 to 1950, roles related to literature such as poets, novelists, and writers were highly represented by both men and women. However, after 1950, other professions such as actors, engineers, and architects grew at a faster rate and even surpassed the growth of the earlier professions. This is closely related to the stage of social and economic development.

It also deserves mentioning that Suffragettes and Suffragists are among the top ten roles for women. If having a detailed look, we can find the time period in which these women lived, it is mostly in the second half of the 19th century and into the 20th century, and the years of the plaques’ erection are concentrated in the 20th and 21st centuries. This is consistent with the rise and development of the modern feminist movement, reflecting the awakening of female consciousness and the beginning of women’s struggle for their rightful rights. Feminists and women’s rights activists began to be regarded as worthy of commemoration.

Finally, from the geographical plot of the distribution of plaques in the UK, once again we can visually see the difference in the number of male and female plaques: the density of male plaques is clearly much higher than that of female plaques. At the same time, the density of plaques in the south and central regions of the UK is relatively high, while the density is low in Scotland. Also, the density is higher in large cities such as London, Birmingham, and Manchester. However, the map does not show a significant regional gender difference, in other words, there is not a strong correlation between the region and lead subject sex.

Conclusion

In conclusion, the Open Plaques data provides us with a valuable window into the social preferences and cognitive evaluations that underlie the historical markers in the UK. The gender disparity of opaque number indicates a gender bias in the representation of historical figures. And the difference in the roles attributed to male and female subjects also reveals how the traditional gender roles and stereotypes have influenced the selection of individuals to be commemorated in the public sphere. However, as women’s status continues to rise and gender consciousness awakens, I am willing to see women participate in the public sphere in more diverse roles, truly be seen, and gain respect and recognition for their achievement.

Lastly, there are another idea I would like to implement and improve if given more time. In visualization two and three, I adopted the erected time as the time variable to analyze role distribution for I hope to see the social value and expectations of those erection eras. To be specific, what kind of people they deem worthy of being remembered and celebrated during that time period. However, I also want to make a change by looking at the timeline of these commemorated individuals, which can reveal the gender-based division of roles during their lifetime. This thought can be achieved because there are two variables that show the born and death year of lead subjects, but I am not sure how to show the lifespan clearly. Since every person has their own lifeline and there is overlap in time, it is difficult to prevent these lines from overlapping in the graph, and it is also difficult to distinguish the beginning and end of the lines. The following is a chart I attempted to create, but it doesn’t effectively convey the information, so I need to explore more in the future.

Bibliography

Mel, W. (2015). The Blue Plaque:Co-creating Design Fictions in the Wild. Retrieved from https://figshare.com/articles/journal_contribution/RTD2015_30_The_Blue_Plaque_Co-Creating_Design_Fictions_in_the_Wild/1328000
Nicole, B. (2012). How community archaeology can make use of open data to achieve further its objectives. Routledge. Retrieved from https://www.tandfonline.com/doi/abs/10.1080/00438243.2012.743252
openplaques.org. (2023). openplaques.org database. Retrieved from https://openplaques.org/data/